Machine Learning Methods for Estimating Heterogeneous Causal Effects
نویسنده
چکیده
In this paper we propose methods for estimating heterogeneity in causal effects in experimental and observational studies, and for conducting inference about the magnitude of the differences in treatment effects across subsets of the population. In applications, our method provides a data-driven approach to determine which subpopulations have large or small treatment effects and to test hypotheses about the differences in these effects. In most of the literature on supervised machine learning (e.g. regression trees, random forests, LASSO, etc.), the goal is to build a model of the relationship between a unit’s attributes and an observed outcome. Cross-validation plays a prominent role: the methods compare predictions to actual outcomes in test samples, in order to select the level of complexity of the model that provides the best out-of-sample predictive power. Our method is closely related, but it differs in that it is tailored for predicting causal effects of a treatment rather than a unit’s outcome. The challenge is that the “ground truth” for a causal effect is not observed for any individual unit: we observe the unit with the treatment, or without the treatment, but not both at the same time, and so new approaches are required to use cross-validation to determine whether a causal effect has been accurately predicted. We propose novel cross-validation criteria for causal effects and demonstrate through simulations the conditions under which they perform better than directly applying standard prediction methods.
منابع مشابه
Facilitating score and causal inference trees for large observational studies
Assessing treatment effects in observational studies is a multifaceted problem that not only involves heterogeneous mechanisms of how the treatment or cause is exposed to subjects, known as propensity, but also differential causal effects across sub-populations. We introduce a concept termed the facilitating score to account for both the confounding and interacting impacts of covariates on the ...
متن کاملCausal Structure Learning and Inference: A Selective Review
In this paper we give a review of recent causal inference methods. First, we discuss methods for causal structure learning from observational data when confounders are not present and have a close look at methods for exact identifiability. We then turn to methods which allow for a mix of observational and interventional data, where we also touch on active learning strategies. We also discuss me...
متن کاملEstimating Heterogeneous Treatment Effects and the Effects of Heterogeneous Treatments with Ensemble Methods
Randomized experiments are increasingly used to study political phenomena because they can credibly estimate the average effect of a treatment on a population of interest. But political scientists are often interested in how effects vary across sub-populations— heterogeneous treatment effects —and how differences in the content of the treatment affects responses—the response to heterogeneous tr...
متن کاملLearning Instrumental Variables with Structural and Non-Gaussianity Assumptions
Learning a causal effect from observational data requires strong assumptions. One possibility is to use instrumental variables, which are typically justified by background knowledge. It is possible, under further assumptions, to discover whether a variable is structurally instrumental to a target causal effect X → Y . However, the few existing approaches are lacking on how general these assumpt...
متن کاملDensity Ratio Estimation: A Comprehensive Review
Density ratio estimation has attracted a great deal of attention in the statistics and machine learning communities since it can be used for solving various statistical data processing tasks such as non-stationarity adaptation, two-sample test, outlier detection, independence test, feature selection/extraction, independent component analysis, causal inference, and conditional probability estima...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015